Guided Constrained Policy Optimization for Dynamic Quadrupedal Robot Locomotion
نویسندگان
چکیده
منابع مشابه
Guided Optimization for Balanced Locomotion
Teaching simulated creatures how to walk and run is a challenging problem. As with a baby learning to walk, however, the task of synthesizing the necessary muscle control is simplified if an external hand to assist in maintaining balance is provided. A method of using guiding forces to allow progressive learning of control actions for balanced locomotion is presented. The process has three stag...
متن کاملLocomotion Gait Optimization for a Quadruped Robot
Legged robot gait generation is a challenging task that involves the control of a large number of degrees of freedom (DOF’s) within a mechanical structure that varies during locomotion. A large number of motion parameters have to be considered in order to obtain a stable, natural and efficient locomotion. Legged robot locomotion applies nonlinear dynamical equations of high order with a multidi...
متن کاملHigh speed locomotion for a quadrupedal microrobot
Research over the past several decades has elucidated some of the mechanisms behind high speed, highly efficient and robust locomotion in insects such as cockroaches. Roboticists have used this information to create biologically-inspired machines capable of running, jumping, and climbing robustly over a variety of terrains. To date, little work has been done to develop an at-scale insect-inspir...
متن کاملMachine Learning for Fast Quadrupedal Locomotion
For a robot, the ability to get from one place to another is one of the most basic skills. However, locomotion on legged robots is a challenging multidimensional control problem. This paper presents a machine learning approach to legged locomotion, with all training done on the physical robots. The main contributions are a specification of our fully automated learning environment and a detailed...
متن کاملConstrained Policy Optimization
For many applications of reinforcement learning it can be more convenient to specify both a reward function and constraints, rather than trying to design behavior through the reward function. For example, systems that physically interact with or around humans should satisfy safety constraints. Recent advances in policy search algorithms (Mnih et al., 2016; Schulman et al., 2015; Lillicrap et al...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Robotics and Automation Letters
سال: 2020
ISSN: 2377-3766,2377-3774
DOI: 10.1109/lra.2020.2979656